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Bengali language 


Bengali /b£i]' cp : li/,^ also known by its endonym Bangla 
(^T°^TT [' baqlaj), is an Indo-Aryan language primarily spoken by 
the Bengalis in South Asia, specifically in the eastern part of the 
Indian subcontinent, native to Bangladesh and the Indian state of 
West Bengal. There are also a significant number of Bengali 
speakers in the Indian state of Tripura and Assam's Barak Valley. 
It is the official and most widely spoken language of Bangladesh 
and second most widely spoken of the 22 scheduled languages of 
India, behind Hindi. With approximately 228 million native 
speakers and another 37 million as second language 
speakers,Bengali is the fifth most-spoken native language 
and the seventh most spoken language by total number of 
speakers in the world.! 8 !! 9 ! 

The official and de facto national language of Bangladesh is 
Modern Standard BengaliJ 10 ^ 11 ^ 12 ^ It serves as the lingua 
franca of the nation, with 98% of Bangladeshis being fluent in 
Bengali as their first language.! ' ^ l4 ' Within India, Bengali is the 
official language of the states of West Bengal, Tripura and the 
Barak Valley region of the state of Assam. It is the most widely 
spoken language in the Andaman and Nicobar Islands in the Bay 
of Bengal,! 15 ! and is spoken by significant populations in other 
states including in Delhi, Chhattisgarh and Jharkhand 
Bengali is also spoken by the significant global Bengali diaspora 
(Bangladeshi diaspora and Indian Bengalis) communities in 
Pakistan, the United Kingdom, the United States, and the Middle 
East.! 17 ! 

Bengali has developed over the course of more than 1,300 years. 
Bengali literature, with its millennium-old literary history, has 
extensively developed since the Bengali Renaissance and is one 
of the most prolific and diverse literary traditions in Asia. The 
Bengali language movement from 1948 to 1956 demanding 
Bengali to be an official language of Pakistan fostered Bengali 
nationalism in East Bengal leading to the emergence of 
Bangladesh in 1971. In 1999, UNESCO recognised 21 February 
as International Mother Language Day in recognition of the 
language movement.! 18 ^ 19 ^ The Bengali language is the 
quintessential element of Bengali identity and binds together a 
culturally diverse region. 
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History 


Ancient languages of Bengal 

Sanskrit was practised by Hindu Brahmins in Bengal since the 
first millennium BCE. However, the local Buddhists were 
speaking in some varieties of the Prakrita languages, coined by 
Dr. Suniti Kumar Chatterjee as an "eastern variety of Magadhi 
Prakrit". During the Gupta Empire, Bengal was a hub of Sanskrit 
literature for Hindu priests, which influenced the vernacular 
spoken by the localsThe Middle Indo-Aryan dialects were 
influential in Bengal in the first millennium when the region was 
a part of the Magadha Realm. These dialects were called 
Magadhi Prakrit and was spoken in modern-day Bihar, west of 

Bengal, and eventually evolved into Ardha MagadhiJ 21 Ardha Magadhi began to give way to what is 
known as Apabhralllsa, at the end of the first millennium. The Bengali language evolved as a distinct 
language by the course of timeJ 23 ^ 
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Silver coin with proto-Bengali script, 
Harikela Kingdom, circa 9th-13th 
century 


Early 

Along with other Eastern 
Indo-Aryan languages, 

Bengali evolved circa 1000- 
1200 CE from Sanskrit and 
Magadhi Prakrit The 
local Apabhraltlsa of the 
eastern subcontinent, Purbi 
Apabhraltlsa or Abahatta 
("Meaningless Sounds"), 
eventually evolved into 
regional dialects, which in 
turn formed three groups of 
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Play media 

A series of picture stories narrated 
in Bengali 


the Bengali-Assamese languages, the Bihari languages, and the Odia 
language . Some argue that the points of divergence occurred much 
earlier - going back to even 500, many argue the literature was lost 
during the medieval times, resulting in many words adopting silent letters. Including many vegetables.^ 4 but 
the language was not static: different varieties coexisted and authors often wrote in multiple dialects in this 
period. For example, Ardhamagadhi is believed to have evolved into Abahatta around the 6th century, which 
competed with the ancestor of Bengali for some timeJ 26 ^ Proto-Bengali was the language of the Pala Empire 
and the Sena dynastyJ 27 ^ 28 ^ 


Medieval 



Silver Taka from the Sultanate of 
Bengal, circa 1417 


During the medieval period, Middle Bengali was characterised by the 
elision of word-final tsj b, the spread of compound verbs and Arabic 
and Persian influences. Bengali was an official court language of the 
Sultanate of Bengal. Muslim rulers promoted the literary 
development of BengaliJ 29 ^ Bengali became the most spoken 
vernacular language in the Sultanate This period saw borrowing 
of Perso-Arabic terms into Bengali vocabulary. Major texts of Middle 
Bengali (1400-1800) include Chandidas' Shreekrishna Kirtana. 


Modern 

The modern literary form of Bengali was developed during the 19th and early 20th centuries based on the 
dialect spoken in the Nadia region, a west-central Bengali dialect. Bengali presents a strong case of diglossia, 
with the literary and standard form differing greatly from the colloquial speech of the regions that identify 
with the language. 431 ' The modern Bengali vocabulary contains the vocabulary base from Magadhi Prakrit 
and Pali, also tatsamas and reborrowings from Sanskrit and other major borrowings from Persian, Arabic, 
Austroasiatic languages and other languages in contact with. 

During this period, there were two main forms of written Bengali: 

■ Cholitobhasha ; colloquial form of Bengali using simplified inflections 

■ Sadhubhasha\ Sanskritised form of Bengali.^ 































In 1948 the Government of Pakistan tried to impose Urdu as the sole 
state language in Pakistan, starting the Bengali language movement 
The Bengali Language Movement was a popular ethno-linguistic 
movement in the former East Bengal (today Bangladesh), which was a 
result of the strong linguistic consciousness of the Bengalis to gain and 
protect spoken and written Bengali's recognition as a state language of 
the then Dominion of Pakistan. On 21 February 1952 five students and 
political activists were killed during protests near the campus of the 
University of Dhaka. In 1956 Bengali was made a state language of 
Pakistan^ 33 ] The day has since been observed as Language Movement 
Day in Bangladesh and is also commemorated as International Mother 
Language Day by UNESCO every year since 2000. 

In 2010, the parliament of Bangladesh and the legislative assembly of 
West Bengal proposed that Bengali be made an official UN 
language/ 34 ^ though no further action was taken on this matter. 



The Central Shaheed Minar in 
Dhaka, Bangladesh 
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Language Martyr's Memorial at 
Silchar Railway Station in 
Assam, India. 


Geographical distribution 


The Bengali language is native to the region of Bengal, which comprises 
Indian states of West Bengal and the present-day nation of Bangladesh. 

Besides the native region it is also spoken by the Bengalis living in 
Tripura, southern Assam and the Bengali population in the Indian union 
territory of Andaman and Nicobar Islands. Bengali is also spoken in the 
neighbouring states of Odisha, Bihar, and Jharkhand, and sizeable 
minorities of Bengali speakers reside in Indian cities outside Bengal, 
including Delhi, Mumbai, Thane, Varanasi, and Vrindavan. There are 
also significant Bengali-speaking communities in the Middle 
East/ 35 ^ 36 ^ 37 ^ the United States/ 38 ^ Singapore,^ 39 ^ Malaysia, Australia, 
Canada, the United Kingdom, and Italy. 


Official status 

The 3rd article of the Constitution of Bangladesh states Bengali to be the 
sole official language of Bangladeshi 12 ^ The Bengali Language 
Implementation Act, 1987 made it mandatory to use Bengali in all 
records and correspondences, laws, proceedings of court and other legal 
actions in all courts, government or semi-government offices, and 
autonomous institutions in Bangladeshi 10 ^ It is also the de facto national 
language of the country. 



Approximate distribution of 
native Bengali speakers 
(assuming a rounded total 
of 261 million) worldwide. 

| | Bangladesh (61.3%) 

| | India (37.2%) 

| | Other (1.5%) 


In India, Bengali is one of the 23 official languagesi 40 ^ It is the official language of the Indian states of West 
Bengal, Tripura and in Barak Valley of Assami 41 ^ 42 ^ Bengali is a second official language of the Indian state 
of Jharkhand since September 2011. It is also a recognised secondary language in the City of Karachi in 








































Pakistan, t 43 ^ 44 ^ 45 ! The Department of Bengali in the University of 
Karachi also offers regular programs of studies at the Bachelors and 
at the Masters levels for Bengali Literature.t 46 ] 

The national anthems of both Bangladesh (Amar Sonar Bangla ) and 
India (Jana Gana Mana ) were written in Bengali by the Bengali 
Nobel laureate Rabindranath TagoreJ 47 ' Additionally, the first two 
verses of Vande Mataram, a patriotic song written in Bengali by 
Bankim Chandra Chatterjee, was adopted as the "national song" of 
India in both the colonial period and later in 1950 in independent 
India. Furthermore, it is believed by many that the national anthem of 
Sri Lanka (Sri Lanka Matha) was inspired by a Bengali poem written 
by Rabindranath Tagore/ 48 ^ 49 ^ 50 ^ 51 ^ while some even believe the 
anthem was originally written in Bengali and then translated into 
SinhalaJ 52 ^ 53 ^ 54 ^ 55 ] 



A Bengali sign in Brick Lane in 
London, which is home to a large 
Bengali diaspora 


After the contribution made by the Bangladesh UN Peacekeeping Force in the Sierra Leone Civil War under 
the United Nations Mission in Sierra Leone, the government of Ahmad Tejan Kabbah declared Bengali an 
honorary language in December 2002_l 56 k 57 ][58][59] 


In 2009, elected representatives in both Bangladesh and West Bengal called for Bengali language to be made 
an official language of the United Nations 


Dialects 

Regional variation in spoken Bengali constitutes a dialect continuum. Linguist Suniti Kumar Chattopadhyay 
grouped these dialects into four large clusters - Rarh, Banga, Kamrupi and Varendra 481 ' but many alternative 
grouping schemes have also been proposedThe south-western dialects (Rarhi or Nadia dialect) form the 
basis of modern standard colloquial Bengali. In the dialects prevalent in much of eastern and south-eastern 
Bangladesh (Barisal, Chittagong, Dhaka and Sylhet Divisions of Bangladesh), many of the stops and 
affricates heard in West Bengal are pronounced as fricatives. Western alveolo-palatal affricates 15 [t£D], H 
[t£ h D], vSi [d£D] correspond to eastern F [tsD], ^ [ts h D~sD], [dzD~zD], The influence of Tibeto-Burman 
languages on the phonology of Eastern Bengali is seen through the lack of nasalised vowels and an alveolar 
articulation of what are categorised as the "cerebral" consonants (as opposed to the postalveolar articulation 
of West Bengal). Some variants of Bengali, particularly Chittagonian and Chakma, have contrastive tone; 
differences in the pitch of the speaker's voice can distinguish words. Rangpuri, Kharia Thar and Mai Paharia 
are closely related to Western Bengali dialects, but are typically classified as separate languages. Similarly, 
Hajong is considered a separate language, although it shares similarities to Northern Bengali dialects J 63 ^ 

During the standardisation of Bengali in the 19th century and early 20th century, the cultural center of Bengal 
was in the city of Kolkata, founded by the British. What is accepted as the standard form today in both West 
Bengal and Bangladesh is based on the West-Central dialect of Nadia District, located next to the border of 
Bangladesh.^ There are cases where speakers of Standard Bengali in West Bengal will use a different word 
from a speaker of Standard Bengali in Bangladesh, even though both words are of native Bengali descent. 
For example, the word salt is nun in the west which corresponds to ^RFf lobon in the east^ 65 ^ 

Spoken and literary varieties 


Bengali exhibits diglossia, though some scholars have proposed triglossia or even n-glossia or heteroglossia 
between the written and spoken forms of the languageJ 31 ' Two styles of writing have emerged, involving 
somewhat different vocabularies and syntax:I 64 k 66 l 
































1. Shadhu-bhasha (srr^M "uptight language") 
was the written language, with longer verb 
inflections and more of a Pali and Sanskrit- 
derived Tatsama vocabulary. Songs such as 
India's national anthem Jana Gana Mana (by 
Rabindranath Tagore) were composed in 
Shadhubhasha. However, use of 
Shadhubhasha in modern writing is uncommon, 
restricted to some official signs and documents 
in Bangladesh as well as for achieving particular 
literary effects. 

2. Cholito-bhasha (uPivo^PTf "running language"), 
known by linguists as Standard Colloquial 
Bengali, is a written Bengali style exhibiting a 
preponderance of colloquial idiom and 
shortened verb forms, and is the standard for 
written Bengali now. This form came into vogue 
towards the turn of the 19th century, promoted 
by the writings of Peary Chand Mitra ( Alaler 
Gharer Dulal, 1857),t 67 ! Pramatha Chaudhuri 
(Sabujpatra, 1914) and in the later writings of 
Rabindranath Tagore. It is modelled on the 
dialect spoken in the Shantipur region in Nadia 
district, West Bengal. This form of Bengali is 
often referred to as the "Nadia standard", 

"Nadia dialect", "Southwestern/West-Central 
dialect" or "Shantipuri Bangla"J 62 ] 

Linguist Prabhat Ranjan Sarkar categorises the language 


■ Madhya Radhi dialect 

■ Kanthi (Contai) dialect 

■ Kolkata dialect 

■ Shantipuri (Nadia) dialect 

■ Maldahiya (Jangipuri) dialect 

■ Barendri dialect 

■ Rangpuriya dialect 

■ Sylheti dialect 

■ Dhakaiya (Bikrampuri) dialect 

■ Jessor/Jessoriya dialect 

■ Barisal (Chandradwip) dialect 

■ Chattal (Chittagong) dialect 


A map of Bengal (and some districts of Assam 
and Jharkhand) which shows the dialects of the 
Bengali Language. 

|P | Bangali dialect 

P | Manbhumi dialect 

| | Varendri dialect 

| | Rarhi dialect 

[ | Sundarbani dialect 

| | *Rajbanshi dialect/language 

| | *Chittagonian dialect/language 

| | *Sylheti dialect/language 

(the name with *bold letter are considered as 

Bengali dialects, also sometimes as separate 

languages.) 


















| | Central Rarh Bengali (compare Sen's 

Jharkhandi dialect) 

[ | Contai dialect 

| | Calcutta Bengali (compare Sundarbani 

dialect) 

| | Nadia Bengali (compare Chatterji's Eastern 

Rarhi dialect) 

| | Maldaiya Bengali 

| | Varendri Bengali (compare Chatterji's 

Varendri dialect) 

[ | Rangpuri Bengali (compare Rangpuri 

language) 

| | Sylheti Bengali (compare Sylheti language) 

| | Vikrampuri Bengali 

| | Jessore Bengali 

[ | Chandradwipi Bengali 

[ | Chittagong Bengali (compare Chittagonian 

language) 


While most writing is in Standard Colloquial Bengali (SCB), spoken dialects exhibit a greater variety. People 
in southeastern West Bengal, including Kolkata, speak in SCB. Other dialects, with minor variations from 
Standard Colloquial, are used in other parts of West Bengal and western Bangladesh, such as the Midnapore 
dialect, characterised by some unique words and constructions. However, a majority in Bangladesh speak in 
dialects notably different from SCB. Some dialects, particularly those of the Chittagong region, bear only a 
superficial resemblance to SCB.^ 68 ^ The dialect in the Chittagong region is least widely understood by the 
general body of Bengalis The majority of Bengalis are able to communicate in more than one variety - 
often, speakers are fluent in Cholitobhasha (SCB) and one or more regional dialects 

Even in SCB, the vocabulary may differ according to the speaker's religion: Muslims are more likely to use 
words of Persian and Arabic origin, along with more native words respectively whereas Hindus are more 
likely to use words derived from Sanskrit.^ For example^ 65 ] 





Predominantly Hindu usage 

Predominantly Muslim usage 

Translation 

nomoshkar 

Assalamu-Alaikum 

hello 

fwt nimontron 

m\3?iT\o dawat 

invitation 

’SwT jol 

pani 

water 

WT?T snan 

C’ttmf gosol 

bath 

fnfpf didi 

mt^apu 

sister / elder sister 

mm dada 

bhai 

brother / elder brother! 70 ] 

mashi 

mmr khala 

maternal aunt 

pishi 

phuphu 

paternal aunt 

rntmt kaka 

mm chacha 

paternal uncle 

sft^m prarthona 

dua 

prayer 

prodeep 

mfib bati 

light 


Phonology 


The phonemic inventory of standard Bengali consists of 29 consonants and 7 vowels, as well as 7 nasalised 
vowels. The inventory is set out below in the International Phonetic Alphabet (upper grapheme in each box) 
and romanisation (lower grapheme). 


Vowels Nasalized vowels 



Front 

Central 

Back 


Front 

Central 

Back 

Close 

i 

i 


u 

u 

Close 

T 

T 


u 

u 



Close-mid 

e 

e 


\3 

0 

0 

Close-mid 

e 

e 


\3 

6 

6 



Open-mid 

mjr 

£ 

e 


m 

3 

6 

Open-mid 

%/mjr 

£ 


3 

Open 


mf 

a 


Open 


mr 

a 

a 







Consonants 



Labial 

Dental/ 

Alveolar 

Retroflex 

Palatoalveolar 

Velar 

Glottal 

Nasal 

m 

n 


Q 


Plosive 

voiceless 

unaspirated 

P 

t 

I 

tX 

k 


aspirated 

p h ~f 

t h 

t h 

tJ h 

k h 


voiced 

unaspirated 

b 

d 

4 

43 

9 


aspirated 

> 

l 

_Q 

d h 

4 h 

d3 h 

9 h 


Fricative 


s 

I 


h 

Approximant 

(w) 

1 


G) 



Rhotic 


r~r 

r~r h 





Bengali is known for its wide variety of diphthongs, combinations of vowels occurring within the same 
syllable.Two of these, /oi/ and /ou/, are the only ones with representation in script, as ^ and 
respectively, /e i O U/ may all form the glide part of a diphthong. The total number of diphthongs is not 
established, with bounds at 17 and 31. An incomplete chart is given by Sarkar (1985) of the following: 



§ 

i 

9 

U 

a 

ae 

ai 

ao 

au 

ae 

aee 


aeo 


e 


ei 


eu 

i 


ii 


iu 

o 

oe 

oi 

OQ 

ou 

u 


ui 




Stress 

In standard Bengali, stress is predominantly initial. Bengali words are virtually all trochaic; the primary 
stress falls on the initial syllable of the word, while secondary stress often falls on all odd-numbered syllables 
thereafter, giving strings such as in ^ Q.C^J rf^TvST sho-ho-jo-gi-ta "cooperation", where the boldface represents 
primary and secondary stress. 


Consonant clusters 

Native Bengali words do not allow initial consonant clusters;^ the maximum syllabic structure is CVC (i.e. 
one vowel flanked by a consonant on each side). Many speakers of Bengali restrict their phonology to this 
pattern, even when using Sanskrit or English borrowings, such as geram (CV.CVC) for £[T?T gram 

(CCVC) "village" or^pT iskul (VC.CVC) for^T skul (CCVC) "school". 


Writing system 





The Bengali script is an abugida, a script with letters for consonants, 
diacritics for vowels, and in which an inherent vowel (\5f 6) is 
assumed for consonants if no vowel is marked.^ The Bengali 
alphabet is used throughout Bangladesh and eastern India (Assam, 

West Bengal, Tripura). The Bengali alphabet is believed to have 
evolved from a modified Brahmic script around 1000 CE (or 10th- 
11th century).Note that despite Bangladesh being majority 
Muslim, it uses the Bengali alphabet rather than an Arabic-based one 
like the Shahmukhi script used in Pakistan. However, throughout 
history there have been instances of the Bengali language being 
written in Perso-Arabic. The use of the Sylheti Nagari script also 
emerged in the Sylhet region of the BengalJ 11 ^ 

The Bengali script is a cursive script with eleven graphemes or signs 
denoting nine vowels and two diphthongs , and thirty-nine graphemes 
representing consonants and other modifiers.t 75 ] There are no distinct 
upper and lower case letter forms. The letters run from left to right 
and spaces are used to separate orthographic words. Bengali script 
has a distinctive horizontal line running along the tops of the 
graphemes that links them together called HTSiT matraP® 

Since the Bengali script is an abugida, its consonant graphemes 
usually do not represent phonetic segments , but carry an "inherent" 
vowel and thus are syllabic in nature. The inherent vowel is usually a 
back vowel, either [D] as in [m3t] "opinion" or [o], as in 
[mon] "mind", with variants like the more open [d]. To emphatically 
represent a consonant sound without any inherent vowel attached to 
it, a special diacritic, called the hosonto (T), may be added below the 
basic consonant grapheme (as in H [m]). This diacritic, however, is 
not common, and is chiefly employed as a guide to pronunciation. 

The abugida nature of Bengali consonant graphemes is not consistent, however. Often, syllable-final 
consonant graphemes, though not marked by a hosonto, may carry no inherent vowel sound (as in the final H 
in [mon] or the medial H in NImnT [gamla]). 

A consonant sound followed by some vowel sound other than the inherent [D] is orthographically realised by 
using a variety of vowel allographs above, below, before, after, or around the consonant sign, thus forming 
the ubiquitous consonant-vowel typographic ligatures . These allographs, called kar, are diacritical vowel 
forms and cannot stand on their own. For example, the graph [mi] represents the consonant [m] followed 
by the vowel [i], where [i] is represented as the diacritical allograph^ (called ^-^T?T i-kar ) and is placed 
before the default consonant sign. Similarly, the graphs [ma], [mi], ^ [mu], ^ [mu], ^ [mri], CH 

[me~m£], (!?T [moj], CHT [mo] and GTl [mow] represent the same consonant H combined with seven other 
vowels and two diphthongs. In these consonant-vowel ligatures, the so-called "inherent" vowel [D] is first 
expunged from the consonant before adding the vowel, but this intermediate expulsion of the inherent vowel 
is not indicated in any visual manner on the basic consonant sign H [m3]. 

The vowel graphemes in Bengali can take two forms: the independent form found in the basic inventory of 
the script and the dependent, abridged, allograph form (as discussed above). To represent a vowel in isolation 
from any preceding or following consonant, the independent form of the vowel is used. For example, in TRt 
[moj] "ladder" and in RfmN [ilij] "Hilsa fish", the independent form of the vowel ^ is used (cf. the dependent 
formf«). A vowel at the beginning of a word is always realised using its independent form. 


(wr ^ 

^MWrarnrriir 

srftfY' 

IPS 

vYfrtrffir n 

Triy UKt. LfuMUd 

SfuMt cf 

ftx. tfjuf. 

•i.'va'KT fZriAr Vlt} *YV?r73~ 

#£>yfr?3ri‘VSC 

x-> 

r>V/<r ftpr wav *$y^r 

7il wie*. 

■i* itvta cL/.idl*. 

CchcA. i/i votA ftcLASlCi 

JBMfS J 7 B ,Y ^ 

rfanrrQvr'frflr, 
sfM^-Jrfyr vryfr' i f 

tJii. httcJty JUa U 

■{*$**"*i, 

iWitlMlfm. hM trtrtttjZ JyaAs, 


An example of handwritten Bengali. 
Part of a poem written in Bengali 
(and with its English translation 
below each Bengali paragraph) by 
Nobel Laureate Rabindranath 
Tagore in 1926 in Hungary 



In addition to the inherent-vowel-suppressing hosonto, three more diacritics are commonly used in Bengali. 
These are the superposed chdndrdbindu (o), denoting a suprasegmental for nasalisation of vowels (as in FT 1 ! 
[t Jad] "moon"), the postposed onusbar (o°) indicating the velar nasal [q] (as in ^T\<TT [baqla] "Bengali") and 
the postposed bisorgo (og) indicating the voiceless glottal fricative [h] (as in [uh] "ouch!") or the 
gemination of the following consonant (as in ^§^1 [duk h : D] "sorrow"). 

The Bengali consonant clusters (^v3’<U'g??T juktobenjon ) are usually realised as ligatures, where the consonant 
which comes first is put on top of or to the left of the one that immediately follows. In these ligatures, the 
shapes of the constituent consonant signs are often contracted and sometimes even distorted beyond 
recognition. In the Bengali writing system, there are nearly 285 such ligatures denoting consonant clusters. 
Although there exist a few visual formulas to construct some of these ligatures, many of them have to be 
learned by rote. Recently, in a bid to lessen this burden on young learners, efforts have been made by 
educational institutions in the two main Bengali-speaking regions (West Bengal and Bangladesh) to address 
the opaque nature of many consonant clusters, and as a result, modern Bengali textbooks are beginning to 
contain more and more "transparent" graphical forms of consonant clusters, in which the constituent 
consonants of a cluster are readily apparent from the graphical form. However, since this change is not as 
widespread and is not being followed as uniformly in the rest of the Bengali printed literature, today's 
Bengali-learning children will possibly have to learn to recognise both the new "transparent" and the old 
"opaque" forms, which ultimately amounts to an increase in learning burden. 

Bengali punctuation marks, apart from the downstroke I dari - the Bengali equivalent of a full stop - have 
been adopted from western scripts and their usage is similar.^ 61 ! 

Unlike in western scripts (Latin, Cyrillic, etc.) where the letter-forms stand on an invisible baseline, the 
Bengali letter-forms instead hang from a visible horizontal left-to-right headstroke called HT3T matra. The 
presence and absence of this matra can be important. For example, the letter vo to and the numeral O "3" are 
distinguishable only by the presence or absence of the matra, as is the case between the consonant cluster tn 
tro and the independent vowel ^3 e. The letter-forms also employ the concepts of letter-width and letter- 
height (the vertical space between the visible matra and an invisible baseline). 

There is yet to be a uniform standard collating sequence (sorting order of graphemes to be used in 
dictionaries, indices, computer sorting programs, etc.) of Bengali graphemes. Experts in both Bangladesh and 
India are currently working towards a common solution for this problem. 


Orthographic depth 

The Bengali script in general has a comparatively shallow orthography, i.e., in most cases there is a one-to- 
one correspondence between the sounds (phonemes) and the letters (graphemes) of Bengali. But grapheme- 
phoneme inconsistencies do occur in certain cases. 

One kind of inconsistency is due to the presence of several letters in the script for the same sound. In spite of 
some modifications in the 19th century, the Bengali spelling system continues to be based on the one used for 
Sanskrit/ 61 ^ and thus does not take into account some sound mergers that have occurred in the spoken 
language. For example, there are three letters ( X T, and ^f) for the voiceless postalveolar fricative [J], 
although the letter ^ retains the voiceless alveolar sibilant [s] sound when used in certain consonant 
conjuncts as in [sk h Dlon] "fall", [spDndon] "beat", etc. The letter ^ also retains the voiceless 

retroflex sibilant [£] sound when used in certain consonant conjuncts as in [kD^D] "suffering", 

[go£t h i] "clan", etc. Similarly, there are two letters (tSf and ^T) for the voiced postalveolar affricate [d3]. 
Moreover, what was once pronounced and written as a retroflex nasal T [iq] is now pronounced as an alveolar 
[n] when in conversation (the difference is heard when reading) (unless conjoined with another retroflex 
consonant such as fe, &, 'S and TJ), although the spelling does not reflect this change. The open-mid front 



unrounded vowel [£] is orthographically realised by multiple means, as seen in the following examples: 
[£to] "so much", [£kademi] "academy", [Emiba] "amoeba", OT^fT [d£k h a] "to see", 

[b£sto] "busy", ^Tl<P><]°r [bEkorDn] "grammar". 


Another kind of inconsistency is concerned with the incomplete coverage of phonological information in the 
script. The inherent vowel attached to every consonant can be either [D] or [o] depending on vowel harmony 
(TBT>l^[v)) with the preceding or following vowel or on the context, but this phonological information is not 
captured by the script, creating ambiguity for the reader. Furthermore, the inherent vowel is often not 
pronounced at the end of a syllable, as in <T>M [kum] "less", but this omission is not generally reflected in the 
script, making it difficult for the new reader. 

Many consonant clusters have different sounds than their constituent consonants. For example, the 
combination of the consonants ^ [k] and ^ [ 5 ] is graphically realised as ^ and is pronounced [kk h D] (as in 
[rukk h o] "coarse") or [kk h o] (as in [kk h oti] "harm") or even [kk h D] (as in [kk h Dmota] 

"capability"), depending on the position of the cluster in a word. The Bengali writing system is, therefore, not 
always a true guide to pronunciation. 


Uses 

The script used for Bengali, Assamese and other languages is known as Bengali script. The script is known 
as the Bengali alphabet for Bengali and its dialects and the Assamese alphabet for Assamese language with 
some minor variations. Other related languages in the nearby region also make use of the Bengali alphabet 
like the Meitei language in the Indian state of Manipur, where the Meitei language has been written in the 
Bengali alphabet for centuries, though the Meitei script has been promoted in recent times. 


Romanisation 

There are various Romanisation systems used for Bengali created in recent years which have failed to 
represent the true Bengali phonetic sound. The Bengali alphabet has often been included with the group of 
Brahmic scripts for romanisation where the true phonetic value of Bengali is never represented. Some of 
them are the International Alphabet of Sanskrit Transliteration or IAST system (based on diacritics)/ 77 ^ 
"Indian languages Transliteration" or ITRANS (uses upper case letters suited for ASCII keyboards)/ 78 ^ and 
the National Library at Kolkata romanisation/ 79 ^ 

In the context of Bengali romanisation, it is important to distinguish transliteration from transcription. 
Transliteration is orthographically accurate (i.e. the original spelling can be recovered), whereas transcription 
is phonetically accurate (the pronunciation can be reproduced). 

Although it might be desirable to use a transliteration scheme where the original Bengali orthography is 
recoverable from the Latin text, Bengali words are currently Romanized on Wikipedia using a phonemic 
transcription, where the true phonetic pronunciation of Bengali is represented with no reference to how it is 
written. 

The most recent attempt has been by publishers Mitra and Ghosh with the launch of three popular children's 
books, Abol Tabol, Hasi Khusi and Sahoj Path in Roman script at the Kolkata Book Fair 2018. Published 
under the imprint of Benglish Books, these are based on phonetic transliteration and closely follow spellings 
used in social media but for using an underline to describe soft consonants. 


Grammar 



Bengali nouns are not assigned gender, which leads to minimal changing of adjectives (inflection). However, 
nouns and pronouns are moderately declined (altered depending on their function in a sentence) into four 
cases while verbs are heavily conjugated, and the verbs do not change form depending on the gender of the 
nouns. 


Word order 

As a head-final language, Bengali follows subject-object-verb word order, although variations to this theme 
are commonJ 80 ^ Bengali makes use of postpositions, as opposed to the prepositions used in English and 
other European languages. Determiners follow the noun, while numerals, adjectives, and possessors precede 
the noun. [81] 

Yes-no questions do not require any change to the basic word order; instead, the low (L) tone of the final 
syllable in the utterance is replaced with a falling (HL) tone. Additionally, optional particles (e.g. -ki, HT - 
na, etc.) are often encliticised onto the first or last word of a yes-no question. 

Wh-questions are formed by fronting the wh-word to focus position, which is typically the first or second 
word in the utterance. 


Nouns 

Nouns and pronouns are inflected for case, including nominative, objective, genitive (possessive), and 
locative.The case marking pattern for each noun being inflected depends on the noun's degree of 
animacy. When a definite article such as -T?T - ta (singular) or -'30 5 TT -gulo (plural) is added, as in the tables 
below, nouns are also inflected for number. 

In most of the Bengali grammar books, cases are divided into 6 categories and an additional possessive case 
(possessive form is not recognised as a type of case by Bengali grammarians). But in terms of usages, cases 
are generally grouped into only 4 categories. 


Singular noun inflection Plural noun inflection 



Animate 

Inanimate 


Animate 

Inanimate 

Nominative 


'Sjx>lth 
juta-ta 
the shoe 

Nominative 

chatro-ra 
the students 


^I^jId 

chatro-ti 

the 

student 

'Sjx>i'3c^rr/ 

'SjpoT'atTTT 

juta- 

gula/juto- 

gulo 

the shoes 

Objective 

chatro- 

ti-ke 

the 

student 

'Sjx>lt>T 
juta-ta 
the shoe 

Objective 

chatro-der(ke) 
the students 

'^ot'3^TT/ 

juta- 

gula/juto- 

gulo 

the shoes 

Genitive 

chatro- 

ti-r 

the 

student's 

’SpTtfra 
juta-ta-r 
the shoe's 

Genitive 

chatro-der 
the students' 

'SjpoT'StTTni 

juta- 

gula/juto- 
gulo-r 
the shoes' 

Locative 

- 

juta-ta-y 
on/in the 
shoe 

Locative 

- 





juta- 

gula/juto- 
gulo-te 
on/in the 
shoes 


When counted, nouns take one of a small set of measure words. Nouns in Bengali cannot be counted by 
adding the numeral directly adjacent to the noun. An appropriate measure word (MW), a classifier, must be 
used between the numeral and the noun (most languages of the Mainland Southeast Asia linguistic area are 
similar in this respect). Most nouns take the generic measure word -T?T -ta, though other measure words 
indicate semantic classes (e.g. -tSRT -jon for humans). There is also the classifier -khana, and its diminutive 
form -khani, which attach only to nouns denoting something flat, long, square, or thin. These are the least 
common of the classifiers.^ 82 ^ 


Measure words 


Bengali 

Bengali transliteration 

Literal translation 

English translation 


Noy-ta goru 

Nine-MW cow 

Nine cows 


Koy-ta balish 

How many-MW pillow 

How many pillows 


Onek-jon lok 

Many-MW person 

Many people 


Car-pac-jon shikkhok 

Four-five-MW teacher 

Four to five teachers 


Measuring nouns in Bengali without their corresponding measure words (e.g. \5jf$ at biral instead of 
Ojlotsi f%\5TN at-fa biral "eight cats") would typically be considered ungrammatical. However, when the 
semantic class of the noun is understood from the measure word, the noun is often omitted and only the 
measure word is used, e.g. 'J3^[ 3J1<r>c<l I Shudhu ek-jon thakbe. (lit. "Only one-MW will remain.") 

would be understood to mean "Only one person will remain.", given the semantic class implicit in -tSRT -jon. 

In this sense, all nouns in Bengali, unlike most other Indo-European languages, are similar to mass nouns. 


Verbs 

There are two classes of verbs: finite and non-finite. Non-finite verbs have no inflection for tense or person, 
while finite verbs are fully inflected for person (first, second, third), tense (present, past, future), aspect 
(simple, perfect, progressive), and honour (intimate, familiar, and formal), but not for number. Conditional, 
imperative, and other special inflections for mood can replace the tense and aspect suffixes. The number of 
inflections on many verb roots can total more than 200. 

Inflectional suffixes in the morphology of Bengali vary from region to region, along with minor differences 
in syntax . 

Bengali differs from most Indo-Aryan Languages in the zero copula, where the copula or connective be is 
often missing in the present tenseJ 61 ^ Thus, "he is a teacher" is CN f^l , ^<r> se shikkhok, (literally "he 
teacher")J 85 ' In this respect, Bengali is similar to Russian and Hungarian. Romani grammar is also the 
closest to Bengali grammarJ 84 ' 


Vocabulary 




Bengali has as many as 100,000 separate words, of which 50,000 
are considered Tadbhavas, 21,100 are Tatsamas and the 
remainder loanwords from Austroasiatic and other foreign 
languages. 

However, these figures do not take into account the large 
proportion of archaic or highly technical words that are very 
rarely used. Furthermore, different dialects use more Persian and 
Arabic vocabulary especially in different areas of Bangladesh 
and Muslim majority areas of West Bengal. Hindus, on the other 
hand, use more Sanskrit vocabulary than Muslims. While 
standard Bengali is based on the Nadia dialect spoken in the 
Hindu majority states of West Bengal, about 90% of Bengalis in 
Bangladesh (ca. 148 million) and 27% of Bengalis in West 
Bengal and 10% in Assam (ca. 36 million) are Muslim and 
speak a more "persio-arabised" version of Bengali instead of the 
more Sanskrit influenced Standard Nadia dialect. The productive 
vocabulary used in modern literary works, in fact, is made up 
mostly (67%) of tadbhavas, while tatsamas comprise only 25% 
of the totalJ 85 ^ 86 ^ Loanwords from non-Indic languages 
comprise the remaining 8% of the vocabulary used in modern 
Bengali literature. 

According to Suniti Kumar Chatterji, dictionaries from the early 20th century attributed about 50% of the 
Bengali vocabulary to native words (i.e., naturally modified Prakrit words, corrupted forms of Aryan words, 
and non Indo-European languages. About 45% percent of Bengali words are unmodified Sanskrit, and the 
remaining words are from foreign languages J 87 ^ Dominant in the last group was Persian, which was also the 
source of some grammatical forms. More recent studies suggest that the use of native and foreign words has 
been increasing, mainly because of the preference of Bengali speakers for the colloquial style.^ Because of 
centuries of contact with Europeans, Turkic peoples, and Persians, Bengali has absorbed numerous words 
from foreign languages, often totally integrating these borrowings into the core vocabulary. 

The most common borrowings from foreign languages come from three different kinds of contact. After 
close contact with several indigenous Austroasiatic languages/ 88 ^ 89 ^ 90 ^ 91 ^ and later the Mughal invasion 
whose court language was Persian, numerous Chagatai, Arabic, and Persian words were absorbed into the 
lexiconJ 3 ^ 

Later, East Asian travellers and lately European colonialism brought words from Portuguese, French, Dutch, 
and most significantly English during the colonial period. 

Sample text 




Sources of modern literary Bengali words 
[ 1 67% native 

| 1 25% Sanskrit reborrowings 

| ~ 1 8% indigenous and foreign loans 


The following is a sample text in Bengali of Article 1 of the Universal Declaration of Human Rights: 

Bengali in the Bengali alphabet 

h: nns nrgn wm hnett ter xspjtsr'T i 

3j\5aR i£K35 'SJMTTd tel ^E?TT ^teI 

Bengali in phonetic Romanization 

Dhara ek: Somosto manush shadhinbhabe soman morjada ebong odhikar niye jonmogrohon 
kore. Tader bibek ebong buddhi achhe; sutorang sokoleri eke oporer proti bhratrittosulobh 

































monobhab niye achoron kora uchit. 

Bengali in the International Phonetic Alphabet 

d h ara sk | Jomosto manuj Jad h inb h abe Joman mo rd 3 ada ebor) od h ikar niee 
d 3 onmogrohon ko re | tader bibek ebor) bud h : i atj h e | Jutorar) Jokoleri ske oporer proti 
b h ratrit:oJubb h monob h ab niee atjoron kora utjit 


Gloss 


Clause 1: All human free-manner-in equal dignity and right taken birth-take do. Their reason 
and intelligence exist; therefore everyone-indeed one another's towards brotherhood-ly 
attitude taken conduct do should. 

Translation 

Article 1: All human beings are born free and equal in dignity and rights. They possess 
conscience and reason. Therefore, everyone should act in a spirit of brotherhood towards 
each other. 


See also 


■ Bangla Academy 

■ Bengali dialects 

■ Bengali numerals 

■ Bengali-language newspapers 

■ Chittagonian language 

■ Languages of Bangladesh 

■ Rangpuri l anguage 

■ Romani people 

■ Sylheti language 
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